NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Blessings and Curses of Covariate Shifts: Adversarial Learning Dynamics, Directional Convergence, and Equilibria

Liang, Tengyuan (June 2024, Journal of machine learning research)

Covariate distribution shifts and adversarial perturbations present robustness challenges to the conventional statistical learning framework: mild shifts in the test covariate distribution can significantly affect the performance of the statistical model learned based on the training distribution. The model performance typically deteriorates when extrapolation happens: namely, covariates shift to a region where the training distribution is scarce, and naturally, the learned model has little information. For robustness and regularization considerations, adversarial perturbation techniques are proposed as a remedy; however, careful study needs to be carried out about what extrapolation region adversarial covariate shift will focus on, given a learned model. This paper precisely characterizes the extrapolation region, examining both regression and classification in an infinite-dimensional setting. We study the implications of adversarial covariate shifts to subsequent learning of the equilibrium—the Bayes optimal model—in a sequential game framework. We exploit the dynamics of the adversarial learning game and reveal the curious effects of the covariate shift to equilibrium learning and experimental design. In particular, we establish two directional convergence results that exhibit distinctive phenomena: (1) a blessing in regression, the adversarial covariate shifts in an exponential rate to an optimal experimental design for rapid subsequent learning; (2) a curse in classification, the adversarial covariate shifts in a subquadratic rate to the hardest experimental design trapping subsequent learning.
more » « less
Full Text Available
Reversible Gromov–Monge Sampler for Simulation-Based Inference

https://doi.org/10.1137/23M1550384

Hur, YoonHaeng; Guo, Wenxuan; Liang, Tengyuan (June 2024, SIAM Journal on Mathematics of Data Science)

Full Text Available
Detecting Weak Distribution Shifts via Displacement Interpolation

https://doi.org/10.1080/07350015.2024.2335957

Hur, YoonHaeng; Liang, Tengyuan (May 2024, Journal of Business & Economic Statistics)

Full Text Available
High-dimensional asymptotics of Langevin dynamics in spiked matrix models

https://doi.org/10.1093/imaiai/iaad042

Liang, Tengyuan; Sen, Subhabrata; Sur, Pragya (October 2023, Information and Inference: A Journal of the IMA)

Abstract We study Langevin dynamics for recovering the planted signal in the spiked matrix model. We provide a ‘path-wise’ characterization of the overlap between the output of the Langevin algorithm and the planted signal. This overlap is characterized in terms of a self-consistent system of integro-differential equations, usually referred to as the Crisanti–Horner–Sommers–Cugliandolo–Kurchan equations in the spin glass literature. As a second contribution, we derive an explicit formula for the limiting overlap in terms of the signal-to-noise ratio and the injected noise in the diffusion. This uncovers a sharp phase transition—in one regime, the limiting overlap is strictly positive, while in the other, the injected noise overcomes the signal, and the limiting overlap is zero.
more » « less
Interpolating Classifiers Make Few Mistakes

Liang, Tengyuan; Recht, Benjamin (January 2023, Journal of machine learning research)

Full Text Available
Universal Prediction Band via Semi-Definite Programming

https://doi.org/10.1111/rssb.12542

Liang, Tengyuan (August 2022, Journal of the Royal Statistical Society Series B: Statistical Methodology)

Abstract We propose a computationally efficient method to construct nonparametric, heteroscedastic prediction bands for uncertainty quantification, with or without any user-specified predictive model. Our approach provides an alternative to the now-standard conformal prediction for uncertainty quantification, with novel theoretical insights and computational advantages. The data-adaptive prediction band is universally applicable with minimal distributional assumptions, has strong non-asymptotic coverage properties, and is easy to implement using standard convex programs. Our approach can be viewed as a novel variance interpolation with confidence and further leverages techniques from semi-definite programming and sum-of-squares optimization. Theoretical and numerical performances for the proposed approach for uncertainty quantification are analysed.
more » « less
Full Text Available
A precise high-dimensional asymptotic theory for boosting and minimum-ℓ1-norm interpolated classifiers

https://doi.org/10.1214/22-AOS2170

Liang, Tengyuan; Sur, Pragya (June 2022, The Annals of Statistics)

Full Text Available
Online Learning to Transport via the Minimal Selection Principle

Guo, Wenxuan; Hur, YoonHaeng; Liang, Tengyuan; Ryan, Christopher T (July 2022, Proceedings of Machine Learning Research)

Motivated by robust dynamic resource allocation in operations research, we study the Online Learning to Transport (OLT) problem where the decision variable is a probability measure, an infinite-dimensional object. We draw connections between online learning, optimal transport, and partial differential equations through an insight called the minimal selection principle, originally studied in the Wasserstein gradient flow setting by Ambrosio et al. (2005). This allows us to extend the standard online learning framework to the infinite-dimensional setting seamlessly. Based on our framework, we derive a novel method called the minimal selection or exploration (MSoE) algorithm to solve OLT problems using mean-field approximation and discretization techniques. In the displacement convex setting, the main theoretical message underpinning our approach is that minimizing transport cost over time (via the minimal selection principle) ensures optimal cumulative regret upper bounds. On the algorithmic side, our MSoE algorithm applies beyond the displacement convex setting, making the mathematical theory of optimal transport practically relevant to non-convex settings common in dynamic resource allocation.
more » « less
Full Text Available
Mehler’s Formula, Branching Process, and Compositional Kernels of Deep Neural Networks

https://doi.org/10.1080/01621459.2020.1853547

Liang, Tengyuan; Tran-Bach, Hai (January 2022, Journal of the American Statistical Association)

Full Text Available
How Well Generative Adversarial Networks Learn Distributions

Liang, Tengyuan (September 2021, Journal of machine learning research)

Full Text Available

« Prev Next »

Search for: All records